Search CORE

8 research outputs found

DEvIANT: Discovering Significant Exceptional (Dis-)Agreement Within Groups

Author: A Belfodil
AF Hayes
B Efron
B Ganter
B Ganter
D Eppstein
F Duris
F Lemmerich
GI Webb
H Grosskreutz
M Das
M van Leeuwen
S Geisser
S Minato
SO Kuznetsov
T Cover
W Duivesteijn
W Hämäläinen
W Hämäläinen
Publication venue: HAL CCSD
Publication date: 20/06/2019
Field of study

We strive to find contexts (i.e., subgroups of entities) under which exceptional (dis-)agreement occurs among a group of individuals , in any type of data featuring individuals (e.g., parliamentarians , customers) performing observable actions (e.g., votes, ratings) on entities (e.g., legislative procedures, movies). To this end, we introduce the problem of discovering statistically significant exceptional contextual intra-group agreement patterns. To handle the sparsity inherent to voting and rating data, we use Krippendorff's Alpha measure for assessing the agreement among individuals. We devise a branch-and-bound algorithm , named DEvIANT, to discover such patterns. DEvIANT exploits both closure operators and tight optimistic estimates. We derive analytic approximations for the confidence intervals (CIs) associated with patterns for a computationally efficient significance assessment. We prove that these approximate CIs are nested along specialization of patterns. This allows to incorporate pruning properties in DEvIANT to quickly discard non-significant patterns. Empirical study on several datasets demonstrates the efficiency and the usefulness of DEvIANT. Technical Report Associated with the ECML/PKDD 2019 Paper entitled: "DEvIANT: Discovering Significant Exceptional (Dis-)Agreement Within Groups"

Renal defects in KCNE1 knockout mice are mimicked by chromanol 293B in vivo: identification of a KCNE1-regulated K+ conductance in the proximal tubule

Author: A. M. Neal
Balloch
Barhanin
Barriere
Barriere
Bauer
Belfodil
Boini
Burg
Carlsson
Casimiro
Demolombe
Farkas
Fransen
Frohnert
H. C. Taylor
I. D. Millar
J. D. Kibble
Kibble
Kurokawa
L. Robson
Lerche
Liu
Lock
Maddox
Millar
Millar
Nicolas
Niemeyer
Perry
Pongs
Rubera
S. J. White
Sanguinetti
Schafer
Seebohm
Takumi
Trinh
Vallon
Vallon
Varro
Yang
Zheng
Publication venue: Blackwell Publishing Ltd
Publication date: 01/07/2011
Field of study

Crossref

PubMed Central

The University of Manchester - Institutional Repository

Elements About Exploratory, Knowledge-Based, Hybrid, and Explainable Knowledge Discovery

Author: A Belfodil
A Buzmakov
A Buzmakov
A Hristoskova
AA Bendimerad
B Ganter
B Ganter
B Ganter
C Carpineto
D Grissa
G Sourek
H Blockeel
Jilles Vreeken
JW Tukey
K Bertet
K Janowicz
M Alam
M Hilario
M Kaytoue
M Kaytoue
M Kaytoue
M Kaytoue
M Leeuwen
M Rouane-Hacene
MJ Zaki
N Lavrac
Omer Sagi
P Brazdil
P Eklund
P Nguyen
P-N Tan
Riccardo Guidotti
SN Tran
SO Kuznetsov
T De Bie
T Fawcett
TG Dietterich
V Codocedo
V Codocedo
V Duquenne
W Duivesteijn
W Ugarte
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/06/2019
Field of study

International audienceKnowledge Discovery in Databases (KDD) and especially pattern mining can be interpreted along several dimensions, namely data, knowledge, problem-solving and interactivity. These dimensions are not disconnected and have a direct impact on the quality, applicability, and efficiency of KDD. Accordingly, we discuss some objectives of KDD based on these dimensions, namely exploration, knowledge orientation, hybridization, and explanation. The data space and the pattern space can be explored in several ways, depending on specific evaluation functions and heuristics, possibly related to domain knowledge. Furthermore, numerical data are complex and supervised numerical machine learning methods are usually the best candidates for efficiently mining such data. However, the work and output of numerical methods are most of the time hard to understand, while symbolic methods are usually more intelligible. This calls for hybridization, combining numerical and symbolic mining methods to improve the applicability and interpretability of KDD. Moreover, suitable explanations about the operating models and possible subsequent decisions should complete KDD, and this is far from being the case at the moment. For illustrating these dimensions and objectives, we analyze a concrete case about the mining of biological data, where we characterize these dimensions and their connections. We also discuss dimensions and objectives in the framework of Formal Concept Analysis and we draw some perspectives for future research

Crossref

INRIA a CCSD electronic archive server

Optimal Subgroup Discovery in Purely Numerical Data

Author: A Belfodil
A Soulet
B Ganter
F Lemmerich
G Bosc
GC Garriga
H Grosskreutz
H Grosskreutz
H Grosskreutz
K Moreland
M Atzmueller
M Boley
S Garcia
S Wrobel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/05/2020
Field of study

International audienceSubgroup discovery in labeled data is the task of discovering patterns in the description space of objects to find subsets of objects whose labels show an interesting distribution, for example the disproportionate representation of a label value. Discovering interesting subgroups in purely numerical data-attributes and target label-has received little attention so far. Existing methods make use of discretization methods that lead to a loss of information and suboptimal results. This is the case for the reference algorithm SD-Map*. We consider here the discovery of optimal subgroups according to an interestingness measure in purely numerical data. We leverage the concept of closed interval patterns and advanced enumeration and pruning techniques. The performances of our algorithm are studied empirically and its added-value w.r.t. SD-Map* is illustrated

Crossref

HAL Descartes

Hal-Diderot

Mining Formal Concepts using Implications between Items

Author: A Belfodil
A Gély
A Soulet
AV Aho
B Ganter
B Ganter
B Ganter
B Korte
BL Dietrich
DS Johnson
JP Bordat
M Boley
Peggy Cellier
R Wille
RE Tarjan
S Roman
SO Kuznetsov
SO Kuznetsov
SO Kuznetsov
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

International audienceFormal Concept Analysis (FCA) provides a mathematical tool to analyze and discover concepts in Boolean datasets (i.e. Formal contexts). It does also provide a tool to analyze complex attributes by transforming them into Boolean ones (i.e. items) thanks to conceptual scaling. For instance, a numerical attribute whose values are {1, 2, 3} can be transformed to the set of items {≤ 1, ≤ 2, ≤ 3, ≥ 3, ≥ 2, ≥ 1} thanks to interordinal scaling. Such transformations allow us to use standard algorithms like Close-by-One (CbO) to look for concepts in complex datasets by leveraging a closure operator. However, these standard algorithms do not use the relationships between attributes to enumerate the concepts as for example the fact that ≤ 1 implies ≤ 2 and so on. For such, they can perform additional closure computations which substantially degrade their performance. We propose in this paper a generic algorithm, named CbOI for Close-by-One using Implications, to enumerate concepts in a formal context using the inherent implications between items provided as an input. We show that using the implications between items can reduce significantly the number of closure computations and hence the time effort spent to enumerate the whole set of concepts

Crossref

HAL

Hal-Diderot